• Àüü
  • ÀüÀÚ/Àü±â
  • Åë½Å
  • ÄÄÇ»ÅÍ
´Ý±â

»çÀÌÆ®¸Ê

Loading..

Please wait....

¿µ¹® ³í¹®Áö

Ȩ Ȩ > ¿¬±¸¹®Çå > ¿µ¹® ³í¹®Áö > TIIS (Çѱ¹ÀÎÅͳÝÁ¤º¸ÇÐȸ)

TIIS (Çѱ¹ÀÎÅͳÝÁ¤º¸ÇÐȸ)

Current Result Document :

ÇѱÛÁ¦¸ñ(Korean Title) Towards Effective Entity Extraction of Scientific Documents using Discriminative Linguistic Features
¿µ¹®Á¦¸ñ(English Title) Towards Effective Entity Extraction of Scientific Documents using Discriminative Linguistic Features
ÀúÀÚ(Author) Sangwon Hwang   Jang-Eui Hong   Young-Kwang Nam  
¿ø¹®¼ö·Ïó(Citation) VOL 13 NO. 03 PP. 1639 ~ 1658 (2019. 03)
Çѱ۳»¿ë
(Korean Abstract)
¿µ¹®³»¿ë
(English Abstract)
Named entity recognition (NER) is an important technique for improving the performance of data mining and big data analytics. In previous studies, NER systems have been employed to identify named-entities using statistical methods based on prior information or linguistic features; however, such methods are limited in that they are unable to recognize unregistered or unlearned objects. In this paper, a method is proposed to extract objects, such as technologies, theories, or person names, by analyzing the collocation relationship between certain words that simultaneously appear around specific words in the abstracts of academic journals. The method is executed as follows. First, the data is preprocessed using data cleaning and sentence detection to separate the text into single sentences. Then, part-of-speech (POS) tagging is applied to the individual sentences. After this, the appearance and collocation information of the other POS tags is analyzed, excluding the entity candidates, such as nouns. Finally, an entity recognition model is created based on analyzing and classifying the information in the sentences.
Å°¿öµå(Keyword) Named entity recognition   entity extraction   data mining   data cleaning   sentence segmentation   information extraction  
ÆÄÀÏ÷ºÎ PDF ´Ù¿î·Îµå